Dataset Article | Open Access
Volume 2025 |Article ID 100084 | https://doi.org/10.1016/j.plaphe.2025.100084

The Global Wheat Full Semantic Organ Segmentation (GWFSS) dataset

Zijian Wang,1,16 Radek Zenkl,6,16 Latifa Greche,11,16 Benoit De Solan,1 Lucas Bernigaud Samatan,1,17 Safaa Ouahid,2,15 Andrea Visioni,2 Carlos A. Robles-Zazueta,3,18 Francisco Pinto,3,19 Ivan Perez-Olivera,3,20 Matthew P. Reynolds,3 Chen Zhu,4 Shouyang Liu,4 Marie-Pia D'argaignon,5,21 Raul Lopez-Lozano,5 Marie Weiss,5 Afef Marzougui,6 Lukas Roth,6 Sebastien Dandrifosse,7,22 Alexis Carlier,7,23 Benjamin Dumont,7 Benoît Mercatoris,7 Javier Fernandez,8 Scott Chapman,8 Keyhan Najafian,9 Ian Stavness,9 Haozhou Wang,10 Wei Guo,10 Nicolas Virlet,11 Malcolm J. Hawkesford,11 Zhi Chen,12 Etienne David,13 Joss Gillet,14 Kamran Irfan,14 Alexis Comar,14 and Andreas Hund 6

1Arvalis, France
2International Center for Agricultural Research in the Dry Areas, Rabat, Morocco
3Global Wheat Program, International Maize and Wheat Improvement Center, C.P. 56237, El Batan, Texcoco, Mexico
4Nanjing Agricultural University, China
5EMMAH, UMR1114, LPA CAPTE, INRAE, Avignon, France
6Department of Environmental Systems Science, ETH Zurich, 8092, Zurich, Switzerland
7Plant Sciences & Biosystems Dynamics and Exchanges, TERRA Teaching and Research Centre, Gembloux Agro-Bio Tech, University of Liege, Gembloux, Belgium
8School of Agriculture and Food Sustainability, The University of Queensland, Brisbane, Australia
9Department of Computer Science, University of Saskatchewan, Saskatoon, Canada
10Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
11Sustainable Soils and Crops, Rothamsted Research, West Common, Harpenden, AL5 2JQ, UK
12School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane, Australia
13neoBloom, Munstermannskamp 1, 21335 Lüneburg, Germany
14HIPHEN SAS, 120 rue Jean Dausset, 84140 Avignon, France
15Programa de Doctorado de Ingeniería Agraria, Alimentaria, Forestal y del Desarrollo Rural Sostenible, Universidad de Cordoba, Cordoba, Spain
16These authors contributed equally to this work
17Current address: University of Toulouse, INRAE, UMR DYNAFOR, 31326 Castanet-Tolosan, France.
18Current address: Department of Plant Breeding, Hochschule Geisenheim University, 65366 Geisenheim, Germany.
19Current address: Centre for Crop Systems Analysis, Wageningen University & Research, Bornsesteeg 48, Building 109, 6708 PE Wageningen, The Netherlands.
20Current address: Department of Agricultural and Biosystems Engineering, South Dakota State University, P. O. Box 57007, Brookings, SD, USA.
21Current address: Limagrain Chappes - Centre de Recherche; 28 Rte d’Ennezat 63720 Clermont-Ferrant.
22Current address: Walloon Agricultural Research Centre, Gembloux, Belgium.
23Current address: Osiris Agriculture, France.

Received 
11 Mar 2025
Accepted 
08 Jun 2025
Published
06 Aug 2025

Abstract

Computer vision is increasingly used in farmers' fields and agricultural experiments to quantify important traits. Imaging setups with a sub-millimeter ground sampling distance enable the detection and tracking of plant features, including size, shape, and colour. Although today's AI-driven foundation models segment almost any object in an image, they still fail for complex plant canopies. To improve model performance, the global wheat dataset consortium assembled a diverse set of images from experiments around the globe. After the head detection dataset (GWHD), the new dataset targets a full semantic segmentation (GWFSS) of organs (leaves, stems and spikes) covering all developmental stages. Images were collected by 11 institutions using a wide range of imaging setups. Two datasets are provided: i) a set of 1096 diverse images in which all organs were labelled at the pixel level, and (ii) a dataset of 52,078 images without annotations available for additional training. The labelled set was used to train segmentation models based on DeepLabV3Plus and Segformer. Our Segformer model performed slightly better than DeepLabV3Plus with a mIOU for leaves and spikes of ca. 90 %. However, the precision for stems with 54 % was rather lower. The major advantages over published models are: i) the exclusion of weeds from the wheat canopy, ii) the detection of all wheat features including necrotic and senescent tissues and its separation from crop residues. This facilitates further development in classifying healthy vs. unhealthy tissue to address the increasing need for accurate quantification of senescence and diseases in wheat canopies.

© 2019-2023   Plant Phenomics. All rights Reserved.  ISSN 2643-6515.

Back to top